495 research outputs found

    A frequent acoustic sign of speech motor delay (SMD)

    Get PDF
    Recent studies report prevalence, phenotype, and persistence findings for a paediatric motor speech disorder in addition to childhood dysarthria and childhood apraxia of speech termed Speech Motor Delay (SMD). The aim of the present study was to determine if there is a frequent acoustic sign of SMD, with implications for theory, assessment, and treatment. We examined the frequency of 19 acoustic signs of SMD in audio recordings of continuous speech and word-imitation tasks in three groups of speakers with SMD: 50 children (mean age 5.1 years) with idiopathic Speech Delay (SD) from 6 USA cities; 87 children, adolescents, and adults with eight types of complex neurodevelopmental disorders; and 9 children (mean age 8.8 years) with persistent idiopathic SD from a population-based study of children in the South West of England. The 19 acoustic signs of imprecise or unstable speech, prosody, and voice were standardized on typical speakers of the appropriate dialect. The criterion for a frequent acoustic sign was that it occurred in at least 50% of participants with SMD in each of the three groups. Findings indicated that lengthened mid-vowels and diphthongs was the one sign that met criteria, occurring in 64.4% of the 146 participants with SMD, including 71% of the 87 participants with complex neurodevelopmental disorders. Findings are interpreted to support the potential of this acoustic sign, and possibly several others associated with temporal dimensions of speech sound development, to inform explication of the neuromotor substrates of SMD.</p

    Responding to the Challenges and Opportunities of Workforce 2000

    Get PDF
    We report results of a national study examining the impact of demographic changes in the American workforce on small business management practices.    Telephone interviews with a national random sample of 94 small business owners explored a) if small business owners are aware of changing workforce demographics,  and b) if these small  business  owners are proactively responding to these changes by modifying their personnel practices.   Findings indicate that while small business managers are aware of changing workforce demographics, only a minority have changed their practices to take advantage of the new population available to them

    Cepstral trajectories in linguistic units for text-independent speaker recognition

    Full text link
    The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-642-35292-8_3Proceedings of IberSPEECH, held in Madrid (Spain) on 2012.In this paper, the contributions of different linguistic units to the speaker recognition task are explored by means of temporal trajectories of their MFCC features. Inspired by successful work in forensic speaker identification, we extend the approach based on temporal contours of formant frequencies in linguistic units to design a fully automatic system that puts together both forensic and automatic speaker recognition worlds. The combination of MFCC features and unit-dependent trajectories provides a powerful tool to extract individualizing information. At a fine-grained level, we provide a calibrated likelihood ratio per linguistic unit under analysis (extremely useful in applications such as forensics), and at a coarse-grained level, we combine the individual contributions of the different units to obtain a highly discriminative single system. This approach has been tested with NIST SRE 2006 datasets and protocols, consisting of 9,720 trials from 219 male speakers for the 1side-1side English-only task, and development data being extracted from 367 male speakers from 1,808 conversations from NIST SRE 2004 and 2005 datasetsSupported by MEC grant PR-2010-123, MICINN project TEC09-14179, ForBayes project CCG10-UAM/TIC-5792 and Cátedra UAM-Telefónica

    Acoustic Correlates of Information Structure.

    Get PDF
    This paper reports three studies aimed at addressing three questions about the acoustic correlates of information structure in English: (1) do speakers mark information structure prosodically, and, to the extent they do; (2) what are the acoustic features associated with different aspects of information structure; and (3) how well can listeners retrieve this information from the signal? The information structure of subject-verb-object sentences was manipulated via the questions preceding those sentences: elements in the target sentences were either focused (i.e., the answer to a wh-question) or given (i.e., mentioned in prior discourse); furthermore, focused elements had either an implicit or an explicit contrast set in the discourse; finally, either only the object was focused (narrow object focus) or the entire event was focused (wide focus). The results across all three experiments demonstrated that people reliably mark (1) focus location (subject, verb, or object) using greater intensity, longer duration, and higher mean and maximum F0, and (2) focus breadth, such that narrow object focus is marked with greater intensity, longer duration, and higher mean and maximum F0 on the object than wide focus. Furthermore, when participants are made aware of prosodic ambiguity present across different information structures, they reliably mark focus type, so that contrastively focused elements are produced with greater intensity, longer duration, and lower mean and maximum F0 than noncontrastively focused elements. In addition to having important theoretical consequences for accounts of semantics and prosody, these experiments demonstrate that linear residualisation successfully removes individual differences in people's productions thereby revealing cross-speaker generalisations. Furthermore, discriminant modelling allows us to objectively determine the acoustic features that underlie meaning differences

    Filled pauses in Hungarian: Their phonetic form and function

    Get PDF
    Filled pauses are natural occurrences in spontaneous speech and they may turn up at any level of the speech planning process and in a number of functions. The aim of this paper is to find out whether the diverse functions of filled pauses correlate with diverse articulations resulting in diverse acoustic structures. Spontaneous narratives are used as research material. The duration of the filled pauses and the frequency values of their first two formants are analyzed. The most frequent form, schwa, shows function-dependent realizations as confirmed by the durational values and by the second formant values of these vowel-like sounds
    corecore